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Minority games where groups of agents remember, react or incorporate information with different 
timescales are investigated. We support our findings by analytical arguments whenever possible. 
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C/5 I INTRODUCTION 

(N ; 

Heterogeneity is gradually being recognized as one of the most important ingredients for the modeling of financial 
__' markets. Among the many types of heterogeneities, timescales are difficult to understand analytically, because they 
. increase usually much the complexity of the equations to solve. However, as argued very early [jj], financial market 
Oh' participants have very discernably different timescales, which deserve therefore a detailled study. Among recent works 
(j . on the topic, two models of stochastic volatility have explicitely included an infinite number of timescales 0,0. 

Here we investigate several types of timescale heterogeneities in the Minority Game model (MG) framework: strategy 
change frequency, strategy-strategy correlation, reaction rates and score memory. We are able to provide analytical 
support for our finding in the last three cases. Previous work addressed the effects of trading frequency 0, 0] 
• i-h . analytically. 
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CANONICAL MG 



I ' The minority game is easily defined: at each time step, all the players have to choose between two alternatives; 
those who happen to be in the minority win. This is variant of Arthur's El Farol bar 6] problem where the resource 
level is set to satisfy half of the people. The Minority Game is exactly solvable which makes it an ideal model to 
I""""- study and understand various aspects of the dynamics of competition. 

Mathematically, agent i = 1, • • • ,N takes action a,i(t) 6 {— 1, +i} at each time step and receives a payoff —ai(t)A(t) 
where A(t) is the aggregated outcome A(t) = Yli=i a »(0- The agents base their decisions on public information which 
' is encoded in an integer number n(t) drawn uniformly from 1, • • • , P. In order to process this information they are 
endowed with S strategies, which are fixed maps, or look-up tables, from the current public information /i(t) to an 
action. At time t, agent i decides to trust his best strategy Si(t) — argmax s= i ! ... ! s yi^ s (t), where yi, s (t) is the score of 
O . strategy s of agent i, which evolves according to 

>>: m* +i) =*,.(*) a) 
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Strategy change frequency 



The above definition of the MG assumes implicitely that all the agents may change the strategy that they use 
(si(t)) at each time step. Variants of the game where all the agents update synchronously their Si(t) every T time 
steps have been studied in the literature pj H, El EH, and are exactly solvable in the limit T — > oo, in which case 
they are called batch games. This part of the paper separates the populations into two groups. Fast agents behave as 
usual, whereas slow agents update their strategies synchronously every T time steps. This introduces a heterogeneity 
of time scales. I n essence, it is similar to giving a longer history memory in games where the state fi{t) is the binary 
encoding of the last M winning choice, also known as games with real histories. Giving a larger M to a set of agents 
is already found in the very first paper on the MG, and was analyzed further in |llLll2| . Such agents do surprisingly 
worse than their colleagues as long as the system is not deep into the symmetric phase, which is characterized by 
alternating winning sides; Metzler showed that agents with a larger M need a large alternating probability in 
order to be able to profit from it. The case we study here is much simpler as it does not require real histories and is a 
priori more suitable to mathematical understanding. Unfortunately, since the exact analytical solution does not exist 
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FIG. 1: Variance a 2 /N (top) and predictability H/N (bottom) as a function of a for </>=0.7 and several values of T. Simulations 
with P — 64, averages over 200 samples 



for games where all the agents update synchronously their strategies every T time steps, we cannot solve the mixed 
case either and must resort to numerical simulations. 

The relative composition of the population is tuned by a parameter (/>: if N is the total number of agents, <j)N of 
them are ordinary Minority Game agents, i.e., fast, whereas (1 — 4>)N update their strategy choice variable Si{t) every 
T time steps, that is, their scores evolve following 

t 

y?(t + 1) = y?(t) - J2 a tP A (t') if t MOD T = 

t'=t-T+l 

yl(t+l) = yt(t) otherwise. (2) 
As usual, we shall focus on the predictability 

H = ^(A\ V ) 2 (3) 

where (A\v) is the temporal average of A conditional to /i(t) = v. If H > 0, knowing /i makes it possible to predict 
statistically the next outcome. H measures the amount of information left by the agents in the game. The fluctuations 
<7 2 = (A 2 ) play a special role as they quantify the quality of resource sharing achieved by the population, which is 
usually benchmarked against the fluctuations produced by random choice a 2 /N = 1. It is easy to see indeed that the 
a 2 is nothing else than the average total loss per time step of the population. Of particular interest in the case of 
competing populations are their respective average gains per time step. 

Figure reports the behaviour of H and a 2 as a function of a — PJN, for a fixed <fi and different values of T. In 
the predictable, asymmetric phase (H > 0), the slow agents do not change the unique stationary state; as a result, 
H does not depend on T, but a 2 is slightly lowered as T increases, because strategy switching occurs less often. On 
the other hand, in the unpredictable, symmetric phase (H = 0), multiple stationary states exist as a result of broken 
ergodicity and any modification to the system will accordingly change the final values of a. This is the case 

here: the introduction of a small amount of slow agents reduces the total amount of fluctuations because they damp 
the overreaction of the fast players without contributing too much to the global outcome A. 

What is more surprising is the behaviour of a 2 in the symmetric phase when one varies 4>, as shown in Fig. [2] 
at fixed a — 0.2, slow agents {(f) — 0) produce larger fluctuations than fast ones {4> — I); increasing the fraction of 
fast agents (f> decreases the fluctuations which reaches a minimum below I, the random choice benchmark, and then 
increase again to reach the standard MG value, slightly above I. This means that the two groups live in symbiosis, 
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FIG. 2: Volatility a 2 /N as a function of <?(> for a = 0.2 fixed (P = 32, N = 160). 



and that there is a non-trivial optimal composition of the population. Other known examples of symbiosis in MGs 
include speculators and producers \Ti\ . 

In such cases, it is natural to characterize the information ecology of the model (see [2 13); that is, who exploits 
who. To this end, Fig. © reports the average gain per time step of fast agents -ff, which decreases monotonically as 
their concentration increases, but stays roughly constant as long as 4> < 0.4. In this region, the slow agents provide 
information that the fast agents exploit. The losses of the latter are greatly reduced compared to 4> = lj but they 
seem not to be able to achieve positive gain on average. Similarly, when the slow agents are few [cf> > 0.8), they profit 
from overreacting fast agents. 
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FIG. 3: Average gain of fast (left) and slow (right) agents as a function of <j> for several T. Simulations with N = 300, P = 64, 
averages over 200 samples 



Strategy correlation 

In the standard MG, all the af s are random variables, drawn completely independently from each other. Take an 
agent with two strategies; they will stipulate the same action a fraction c = 1/2 of the //s on average. In other words, 
the standard-MG agents behave in the same way irrespectively on their strategy choice for half of the market states 
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u.|25| MG with tunable c where introduced in Ref. 0] an d also studied later in Refs 0,^1- The parameter c induces 
a reaction time scale ~ 1/(1 — c): the smaller c, the more adaptive an agent is; on the other end, people with c = 1 
are not adaptive: they inject predictability and are ideal candidates for exploitation. The latter were introduced as 
producers, that is, people who do not care much about timing in market, but use the market as a tool for exchanging 
goods 0|; producers (c = 1) and speculators (c < 1) live in symbiosis 0, that is, the gain of a given group increases 
then the other group is also present. 

Here, we consider the case where the two groups s and / have < c g < 1, g = s, f. The apparent similarity of this 
setup with the case previously studied is deceptive, as we shall see. The strategies are drawn according to 
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H,2J 



c s . 



(4) 
(5) 
(6) 



When Cf < c s , group / is the fast one, and group s is the slow one. The asymmetric phase of the model for an 
arbitrary number of groups is exactly solvable in the limit N — > oo with replica trick [111 Il4. [ljj ■ The solution gives 



H 

N 



(j>c } + (1 - (f>)c s + 0(1 - c f )Q f + (1 - 4>){1 - c s )Q s 



(i + 0x/ + (i-0)x s ) 2 (7) 

where Q g = ^Ziejv" ( S «W) 2 (-9 = /j s ); Xf and \s are the integrated response functions 0. These four quantities all 
depend on two variables Q and and £ s which are the solutions of two coupled non-linear equations (see appendix). 
Eq. Q) shows that in this case the stationary state of the asymmetric phase depends on the composition of the 
population, which is also true of the location of the critical point. 

The respective gains of the two groups can also be computed exactly. Starting from the total losses 



(y H _ 
— = — h Ml 

N N VK 



c/)(l-Q/) + (l-^)(l-c s )(l-Q s ) 



(8) 



and observing that if a\ = X)u=i((^fast + XA s \ ow ) 2 \fi) / P, the gains of the agents are nothing else than 



7s 
7/ 
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A=l 



(9) 
(10) 



From the replica calculus, we find 



C s + (1 - C s )Qs 

i + to/ + (i - <t>)x s 



(l-c s )(l-Q s ); 



(11) 



7/ _ 1 cr 2 1 - <t> 7s 

Nf 4> N 4> N s ' 1 ' 

The behaviour of the game is non trivial, which is already clear in Fig 01 where the fluctuations, or average losses 
of the players per time-step, are plotted agains c s , showing a maximum. In this kind of plot, one should be careful 
to stay in the asymmetric phase, because the critical point a c depends on c s , c/, and <fr. Taking a > 1 solves this 
problem since a c < 1. The minimum of a 2 /N is surprising at first. Figure [5] gives a deeper understanding of this 
peculiar phenomenon by plotting the gains of the two groups. The minimum of the losses can be attributed entirely 
the slow agents only, who profit quite remarkably from the fast agents (c/ = 0) unless they are very slow. This shows 
an competition between two effects: being slower means that one overreact less, in particular with respect to local 
fluctuations; inversely, being too slow makes it too difficult to react to being exploited. Interestingly, increasing c s 
increases monotonically the gains of group /. 

The relative fraction of each type of agent is a crucial parameter, as illustrated by Fig0 in particular when one 
group of agents has a large strategy correlation. In order to shed more light on the matter, we produced plots of the 
regions where one group has an advantage over the other. When <fi = 0.5, Figure El contains four regions of interest. 
When c g < c* ~ 0.61, group g on average wins more than the other group g' as long as c g > c' g , which means that 
as long as both groups have sufficiently low cs, the slower the better. On the other hand, when c g > c^m) ~ 0.98, 
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FIG. 4: Fluctuations a 2 /N versus the strategy correlation parameter c s of the slow agents 
P = 64, a = 2, averages over 200 samples. 



= 0.5, Cf — 0). Simulations with 
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FIG. 5: The average gains gj /Nj , g a /N 3 for fast and slow agents respectively (<j> = 0.5, c/ = 0) . Lines are theoretical predictions 
from Eq JTOJ. Simulations with JV = 32, P = 64, averages over 200 samples 
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FIG. 6: Regions of relative advantage for cj> = <j>f = 0.5 (left graph) and (j> = 0.01 (right graph); the region filled corresponds to 
the symmetric phase, in which the replica calculus is not valid. 
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group g' always exploits group g. For intermediate values of c g , the outcome depends on the precise value of both c g 
and c' g . While changing <f> leaves unaffected the diagonal boundary, it has two remarkable effects: first it changes the 
non-linear boundary between the 7/ < 7^ and 7^ < "f s regions by roughly rotating it clockwise. A new region also 
appears for a and 4> small enough where the gain of group in relative sparseness is positive; it is however very small, 
and only appears when the largest group has a very large c. In this case, one needs to be few and react fast in order 
to be able to exploit very slow agents so much that one's gain is positive. 



GAMES WITH NO PUBLIC INFORMATION 



The case where P = 1, that is, when there is no public information available corresponds to the limit a = of the 
previous section, and is particularly simple to understand analytically [Til Il9| ; as a result explicit formulae for the 
fluctuations and gains can be obtained. Given A(t), each agent i receives a payoff —a,i(t)A(t), which is stored in the 
score 

A i (t + l) = A < (i)-^ (13) 



and 



P[a,ft + l) = l] = 1 + tanh(rAW) . (14) 



Ti is a reaction rate: it specifies the difference of behaviour to a change of A^. Because of Eq I1KSI) it is also timescale. 
Linking the above model with financial markets is straighforward if one assumes that the log-price evolves according 

to 

p(t+l)=p(t) + ^. (15) 

This allows us to propose a new interpretation of this case: rewriting Ai(t) = — Y^t'=o A(t)/N + A^O) where Ai(0) 
is the initial condition of agent i, one sees immediately that 

Ai(f) = -p(f) + Ai(0). (16) 

Aj(0) is nothing else than the asset reference price, or value, of agent i. Hence, this equation describes a model of N 
investors having each a value in mind for the price, and acting accordingly. j2|| The agents are therefore fundamentalists 
[l{| who compare the current price with a reference price <f>i, and Ti tunes the price excursion from its supposed 
fundamental value Aj(0) tolerated by agent i. 

If Aj(0) = 0, all the A^s are the same and can be replaced by A. Eq (|13fl becomes 

A(i + 1) = A(i) - tanh(rA(») + n(t) (17) 

where n(t) is a noise term with zero average (n{t)) = and (n(t)n(t')) = 6 tjt >[l - tanh(rA(t)) 2 ]/iV; it vanishes 
therefore in the N — > 00 limit. It is easy to find that the fixed point A^ ^ = is stable if T < T* =2 and (A 2 ) oc N, 
and unstable otherwise; in the latter case, a period 2 dynamics emerges, with the stable points determined by [Ls| 

A (1) = tanh(rA^^)/2. (18) 



and (A 2 ) cx N 2 . A Taylor expansion of Eq (0 gives Ai ~ ±J 3( p 3 2) for T close to 2; on the other hand, ^ - Ai oc 

exp(— r) for large T. A way to check numerically the value of T* is to observe the onset of the change of (A 2 ) from 
0(N~ 1 ) to O(l) as a function of T. Heterogeneous initial conditions A;(0) 7^ help to stabilize the fixed point by 
raising T* Q. 

We shall be particularly interested in the gains of the agents. The knowledge of -P(A) allows us to compute the 
average gain (7) = —(A 2 )/N, i.e. the fluctuations themselves 



N N . 



,S '~ 1 J dAP(A)(A 2 |A) = iy dAP(A)[((M) 2 |A) + (A|A) 2 ] (19) 
= 1 + (N - 1) J dAP(A) tanh(rA) 2 (20) 
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FIG. 7: Fluctuations versus V from numerical simulations (thick line; N = 100, 10 6 iterations per point), from the Fokker-Plank 
equation (thin line, F < 2), the Gaussian approximation (dashed red line), and Eq 1251 1 (dash-dotted line, V > 2) 



If r > r* and N — > oo, we can simply write 



~N~ 2 



4A 2 



(21) 



which is about 12 ^ 3 ^ for F close to 2. For T < 2, one must first keep N finite and derive P(A). The dynamical 
equation for A can be rewritten as a Fokker-Planck equation, which reads 



dP(A) d 2 



l-(N-l) tanh(rA)' 



dt 2dA 2 
Solving this equation in the stationary states gives 

1 



N 



P(A) 



_d_ 

OA 



[tanh(rA)P(A)] 



P(A) 



-(i+r) 



[2 + iV(cosh(2rA) - 1))] — — cosh(rA) z 



(22) 



(23) 



where Z is the normalisation factor. When T is small, P can be approximated by a Gaussian with zero average and 
(A 2 ) = l/[2r[(iV(r + 1) - T]. Therefore 



{A 2 



1 + {N-1)- 



1 + 



1 



r 

2 



n ->oo, r< i 



(24) 



TV y 2[iv(r + i) -r] ~ ' 2(r + i) 

Figure shows that the Fokker-Planck equation provides a good description of the stationary state for T < 1, whereas 
for larger T the hypothesis of small jumps in A is clearly wrong; this is due to the fact that with T > 1, the drift 
term makes A change sign on average at each time step. The Gaussian approximation bends in the wrong way as T 
increases and should not be used for T > 0.01. For T > 2, -P(A) separates into two symmetric peaks, centered roughly 
at ±Ai; it can therefore be approximated by 

P{A) ~ -^[2 + 7V(cosh(2r(A-Ai)-l))]^T* n cosh(r(A-Ai)) 2 



+ 



2Z 

-5- [2 + 7V(cosh(2r(A + Ax)) - l))]^ 11 cosh(r(A + Ai)) 2 
Z2j 



(25) 



Heterogeneous learning rates 



Let us consider G groups of respectively (j) g N agents, g = 1, • • • , G; all the agents belonging to group g have r g . 
Using the notation f(x g ) =J2q 4>gf( x g) f° r an y function / of variables x g , the dynamical equation for A reads now 



A(t + 1) = A(t) - tanh[r g A(t)] + r)(t) 



(26) 
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FIG. 8: Average gain versus Tf from numerical simulations (circles: fast agents with Tf, slow agents with F 3 = 0.001) and Eq 
I3H (dashed lines); average over 100000 time steps, after 10/F S time steps, average over 20 samples. 



where rj g (t) are Gaussian noises with zero average and variance (r](t)r](t')) = $t,? ~ tan j^ rgA ) , Linear stability shows 
that A^ ^ is a stable fixed point as long as 



T g <2. 



When this point is unstable, it replaced by a period-two dynamics with stable point 



(1) = tanh(r g A(D) 
2 

Going through the same procedure as before, one finds that P(A) is determined by 



dP _ -2iVtanh(r g A)[l + r fl (l - tanh(r 9 A) 2 ] - 2r g tanh(r g A)[l - tanh(r 5 A) 2 ] 
P 7Vtanh(r g A) 2 + 1 - tanh(r g A) 2 



(27) 



(28) 



(29) 



This cannot be integrated any more. The Gaussian approximation consists in keeping only the terms linear in A in 
this equation, and results in A being of average and variance (A 2 ) = — = — == — =- for small T. 

The average gain per player of group g at fixed A is equal to 



(7,|A>_ (A g \A) + ^ g (A g A' g \A) 



-l-(Ng-l) tanh(r 9 A) 2 - tanh(r g A) ^ N' g tanh(T^A) (30) 

g' 



Assuming that the Gaussian approximation is valid, i.e. if all the T g are small, 



N n 



g 



2[i + r 3 ] 

for infinite N . The gain of an agent of group g can be compared to the average gain, which yieds 

~g-^g 



(7 9 > ~ (lg) 



2(1 + r s 



(31) 



(32) 



In other words, the smaller T g , the smaller the losses of that group. This is intuitive: at a given time, i.e. at a given 
A, the fraction of players whose action is opposite to the sign of A is larger for smaller T. 

Figure |H1 compares (7) from numerical simulations and from Eq (|31|l . As before, the approximations made are valid 
for small T. This figure also indicates that the transition at T* is smooth. 
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FIG. 9: Fluctuations a 2 /N - 1 versus A for V — 0.1, 0.5, 0.7 and 0.9 (bottom to top) obtained by numerical simulations. 
N = 100, 10 6 iterations per point; 



Heterogenous score memory A 

Another way of having heterogeneous time scales is to differ in one's score memory. Before studying groups, let us 
again characterise first the homogeneous case. For the sake of simplicity, we consider exponential moving averages. 
There are two ways of implementing them: 

A(t + l) = A(t)(l-A)-^ (33) 

and 

A'(t + l) = A'(t)(l-A)-A^ (34) 

The length of score memory is l/|ln(l — A)| oc 1/A for small A. One can transform Eq 133fl into 134|) by setting 
A' = A A and V = T/X. The same stability analysis as above gives for a homogeneous population 

T + A<2 (35) 

and 

A(r' + 1) < 2 (36) 

Increasing A at fixed T and V has therefore opposite effects for the two kinds of dynamics: a small A stabilises Eq 
(|33H and destabilises Eq l|34l) . Since there is a one-to-one correspondance between all the results of Eqs (|33l) and 134[) . 
we shall focus on Eq l|33l) . The period-two fixed point for T + A > 2 is now determined by A^ = tanh(rA^^). 
Assuming that A^°) = is stable, the same procedure as before yields 

dP _ [AA + tanh(rA)](l + A + r[l-tanh(rA)]) + rtanh(rA)[l-tanh(rA) 2 ] 

~~P~~ A(AA + tanh(rA)) 2 + 1 - tanh(rA) 2 ( ' 

In the Gaussian approximation, one finds 

^ A ^ = 2[(r + A)(l + A + T)N — r 2 ] ' ^ 38 ^ 

As consequence, for small T, A > decreases the fluctuations. The Fokker-Planck equation holds if A + Y < 1; 
otherwise, the change of sign of A at each time step (neglecting the noise) causes more fluctuations. Therefore, a 2 /N 
has a minimum at A + T = 1 when A increases, as illustrated by Fig[!|| The Gaussian approximation is very good for 
small r and A. 
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Generalising these results to groups of agents is more complicated than for heterogeneous T because of the role of 
A in Eq. (|33|) : each group g with a given A g has its own A g 



A g (t + 1) = A g (t)(l - X g ) - tanh(rA 9 + v (t) = f g ({A g ,}) + r,(t) (39) 



where {rj{t)ri{t')) = <5tt'(l — tanh(rA g <) 2 )/./V. For infinite N, the linear stability conditions of the fixed point A g = 
for all g are J2 gl \df g /dA g i \ < 1, that is, 

|1 - X g - g T g \ + ]T <t> g , tanh(r fl ,A 9 < 1 (40) 
g'j=g 

for all g. When X g + 4>g^g < 1; i- e - both A g and T are small, the stability condition is T g > < X g + 2cf> g T g ] for instance 
if r g = r for all g, and <fi g — 1/G, then the condition is T(l — 2/G) < A g . The other case is obtained when T g ~ l/0 g 
and gives A g + T g i < 2, which is the same as before. Interestingly, some unstable A g can coexist with stable ones, 
for instance when r s = T = 2 — e, < e C 1, 4> = 1/G and X g = 2eg/G, where the dynamics of A g is stable for 
g < G/2 and unstable otherwise. This is clearly a source of losses for fast-forgetting players. Going back to the 
average gain per player per time step, ones indeed that the gain of group g is intimately related to the variance of 
A 9 . The multivariate Fokker-Planck equation reads 



dP{A,t) _ y, d 



dt ^ dA„ 

9 J 



([X g A g + tanh(rA 9 ,)]P) + \ ^ J^S D ^' P 

g,g' 9 



(41) 



with 



1 - tanh(r>Ag») 



2 



D 9,9> = N + ( x 9&g + tanh(i>A fl „))(A s <A g , + tanh(r g » A g „)) (42) 

Solving a linearised version of Eq (|41|l is done following standard procedure [2(| and gives a multivariate Gaussian 
solution. For G = 2, we have the resulting equations. 



P(A '' As) = o X R 2 eXp 



2pA f A s 




(43) 



2(1 - p 2 ) y aj a f a s 

The expressions for Uf,a s and p are too long to be reported here j^- Reusing Eq l|30|) the respective average gain is 

( 7/ ) ~ -l-r 2 ((A})(N f -l)+N s (A f A s )) 

= -l-T 2 f a}(N f -l)-T f T s N s pa f a s ] (44) 

Figure ^| plots the gains of the two groups and clearly shows that all other things beeing equal, having a shorter 
memory is an advantage. Indeed, when T/ = T s = T and <fif = <fi s = 1/2, as in Fig. (|1U|I . Eq (|44ll leads to 

8r 2 (A / -A s )(l+r + A / + A s )[A / (l + A / + £) + A s (l + A s + ^) + A / A s ] 

(7/) ~ [Is) 



[2X f X s + T(A/ + A s )][4r 2 + 2(2 + 2A/ + A s )(2 + X f + 2A S ) + 3r(4 + 3A/ + 3A S 

1 



2r 2 + r(2 + 3A/ + 3A S ) + 2(A/ + X) + X s + A 2 + A/A s ) 



(45) 



which is of course positive when Xf > X s . The above sections suggest that such advantage is menaced by increasing 
Tf. Interestingly, increasing <pf increases the gains of both groups, as the fast agents suffer less from the fluctuations 
caused by their slow colleagues. In the unstable region, the effect is the opposite, that is, fast forgetting agents fail 
to smooth out sufficiently slowly large fluctuations, and suffer from larger losses than slower agents. We performed 
similar numerical simulations for P > 1 and Y = oo (standard MG), and found out similar results: faster agents end 
up earning less in both phases (we could not find an opposite result), which can be interpreted by their tendency to 
switch more often between their strategies. 
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CONCLUSIONS AND REMARKS 



The four types of timescale heterogeneities investigated point to compatible and broad conclusions. First, agents 
with a smaller strategy change frequency are similar to agents with larger strategy-strategy correlation, except that 
their presence does not change the onset of the critical point, nor H . Accordingly, the gain of fast/slow agents in both 
cases show similar behaviour. Although Ref. 4] did not compute the average gain as a function of playing frequency 
it did show however that agents that trade less often tend to stick more to one strategy in the asymmetric phase; 
since frozen agents have a higher average payoff, this suggests agent who play less often win more. We confirmc this 
intuition by extensive numerical simulations, and checked that this conclusion does not depend on the concentration 
of slower agents. However, the results is the opposite in the symmetric phase where slower agents are exploited, while 
all the agents have an equal gain at the critical point. Overreacting, that is, having a larger reaction rate than the 
average population is detrimental. Reference |5j did not study the gain of agents with heterogeneous reaction rates, 
therefore we refrain to generalize this conclusion to P > 1. Finally, for P = 1, agents with a smaller finite score 
memory do generally better than average, unless they have a too large reaction rate. 

D.C thanks B. Hambly for technical help, and Wadham College for support. 
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[25] This is the origin of predictability in the MG |2l|. 

[26] Ref [22^ proceeded the other way around by making value investors with heterogeneous expectations on the fair value 
playing a delayed majority ga me I23I |24| and noticing that the fluctuations look like those produced by minority players; 
this was shown analytically in jl9l | 

[27] We provide however a Mathematica file at www.maths.ox.ac.uk/ challet 



REPLICA CALCULUS 



The calculus parallels mostly the established procedure [ill fl4| : the dynamics minimizes H 0] that is akin to 
an energy. The stationary state of the system corresponds therefore to the ground state of H . After some algebraic 
manipulations it is possible to find relations that link the quantities that we have introduced The predictability is 
given by 



H x _ 4>c f + A 2 (l - 4>)c s + 4>{l - c f )Qf + A 2 (l - (f>)(l - c s )Q s 



N (l + 0x/+A 2 (l-0)x,) 2 

where the integrated response functions \s and Xf are defined as 



Xf 



The self-overlap Q„ is equal to 



_(l_^)erffi| 



erf 



Q g = l 



erf 



Xs 



Xf erf 



v (g 



c fl v erf v^ 



All these quantities depend on Q and £/, determined through 



a(l- c f ) 



<j>c f + (1 - 4>) c s + <f> (1 - c f ) Qf + (1 - cf>) (1 - c s ) Q s 



k 

Cs 



- ' V" <? 



2V^/2 



+ (1-^)(1- C ,) 

y v 7r q 

Finally, the average gain of slow agents is 



7f_ 



1 19 a 2 



1 - 2 d\ N 
that leads to the expressions l|llfl . 



A = l 



1 Um 6HM 



1-0 p^oo d\ 



A=l 



1 ~Cf . 

1 — c s ' 



(1-C)(1-Q.) 



(46) 



(47) 



(48) 



(49) 



(50) 



(51) 



More than two groups 

The results above are readily generalised to G groups denoted by g = 1, . . . , G: group g comprises N g = <f) g N agents 
equipped with two strategics with correlation c g : 

H _ (c + (l- C )Q) (52) 



N (i+(x)r 

fr , ((i-c)(i-Q)) 



(53) 



N N N 

- *X a - C3+ ?-y Q3 -(l-c 9 )(l-Q g ) (54) 



A g iV AS 1 + ( X ) 



where the average (.) is over the groups. 

'-«'-<i/!4H-?M£))> + 

Cg_ = / l-C g 



